gh-111089: Add cache to PyUnicode_AsUTF8() for embedded NUL by vstinner · Pull Request #111587 · python/cpython

vstinner · 2023-11-01T02:30:10Z

Add PyASCIIObject.state.embed_null member to Python str objects. It is used as a cache by PyUnicode_AsUTF8() to only check once if a string contains a null character. Strings created by PyUnicode_FromString() initializes embed_null since the string cannot contain a null character.

Global static strings now also initialize the embed_null member. The chr(0) singleton ("\0" string) is the only static string which contains a null character.

Issue: [C API] Change PyUnicode_AsUTF8() to return NULL on embedded null characters #111089

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

gh-111089: Add cache to PyUnicode_AsUTF8() for embedded NUL#111587

gh-111089: Add cache to PyUnicode_AsUTF8() for embedded NUL#111587
vstinner wants to merge 9 commits intopython:mainfrom
vstinner:unicode_embed_null

vstinner commented Nov 1, 2023 •

edited by bedevere-app bot

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Uh oh!

Conversation

vstinner commented Nov 1, 2023 • edited by bedevere-app bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

vstinner commented Nov 1, 2023 •

edited by bedevere-app bot

Loading